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GENEALOGIES IN EXPANDING POPULATIONS 

By Rick Durrett* and Wai-Tong (Louis) Fan^ 

Duke University and University of North Carolina at Chapel Hill 

The goal of this paper is to prove rigorous results for the behavior 
of genealogies in a one-dimensional long range biased voter model in¬ 
troduced by Hallatschek and Nelson [25]. The first step, which is 
easily accomplished using results of Mueller and Tribe [38], is to 
show that when space and time are rescaled correctly, our biased 
voter model converges to a Wright-Fisher SPDE. A simple extension 
of a result of Durrett and Restrepo [18] then shows that the dual 
branching coalescing random walk converges to a branching Brow¬ 
nian motion in which particles coalesce after an exponentially dis¬ 
tributed amount of intersection local time. Brunet et al. [8] have 
conjectured that genealogies in models of this type are described by 
the Bolthausen-Sznitman coalescent, see [39]. However, in the model 
we study there are no simultaneous coalescences. Our third and most 
significant result concerns “tracer dynamics” in which some of the 
initial particles in the biased voter model are labeled. We show that 
the joint distribution of the labeled and unlabeled particles converges 
to the solution of a system of stochastic partial differential equations. 

A new duality equation that generalizes the one Shiga [44] developed 
for the Wright-Fisher SPDE is the key to the proof of that result. 


1. Introduction. Since the 1970s the dominant viewpoint in cancer 
modeling has been that tumors evolve through clonal succession, i.e., there is 
a sequence of mutations advantageous for cell growth that sweep to fixation. 
However, recent work of Sottoriva et al. [45] has shown that in colon cancer 
most of common mutations in a tumor were present at initiation, while 
mutations that arise later are only present in small regions of the tumor. The 
long range goal of our research is to understand the behavior of genealogies 
in a growing tumor and the resulting patterns of genetic heterogeneity. This 
is important because cancer treatment may fail if one does not have an 
accurate knowledge of the mutational types present in the tumor. 

The genetic forces at work in a growing tumor are very similar to those 
in a population expanding into a new geographical area. In this case most 
of the advantageous mutation occur near the front, see Edmonds, Lille, and 
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Cvalli-Sforza [19], and simulation studies of a spatial model in [32], There are 
a half dozen papers by Oscar Hallatschek and collaborators [24, 25, 26, 33, 
36, 39] using nonrigorous arguments to study genealogies in an expanding 
population. A rigorous analysis in dimensions d > 2 seems difficult, so we 
will restrict our attention here to a one dimensional model closely related to 
one introduced by Hallatshcek and Nelson [25]. Our version of their model 
is a sequence of biased voter models on 

-^n ■= X {1, . . . , M„,}- 

There is one cell at each point of A„, whose cell-type is either 1 or 0. The cells 
in deme w G only interact with those in demes w — L~^ and w + L~^. 

Hence each cell x = {w, i) has 2M„ neighbors. Type-0 cells reproduce at rate 
2M„r„, type-1 cell at rate 2M„(r„ -|- 9R~^) where 9 G [0,oo). When repro¬ 
duction occurs the offspring replaces a neighbor chosen uniformly at random. 
In the terminology of evolutionary games, this is birth-death updating. 

1.1. Interacting particle system duality. Let Ctix) be the type of the cell 
at X at time t. Our (rescaled) biased voter model {^t)t>o can be constructed 
using two independent families of i.i.d. Poisson processes: {P^’^ : x ~ y} 
that have rate r„, and {P^’^ : x ~ y}, that have rate 9R~^. At a jump time 
of P^’^ , the cell at x is replaced by an offspring of the one at y. At a jump 
time of the cell at x is replaced by an offspring of the one at y only 
if y has cell-type 1. To avoid clutter we have suppressed the superscript n’s 
on and the two Poisson processes. 

To formally construct the process and dehne a useful dual process, we 
recall the graphical representation of the biased voter model introduced by 
Harris (1976) and developed by Griffeath (1978). The ingredients are arrows 
that spread fluid in the direction of their orientation, and (5’s which are dams 
that stop fluid. At times s of Px^y we draw an arrow from (s, y) —)• (s, x). At 
times s of Px,y we draw an arrow from (s, y) — >■ (s, x) and put a <5 at (s—, x). 
The s— indicates that the S is placed just before the head of the arrow. 
Intuitively, we inject fluid into the bottom of the graphical representation 
at the sites of the conhguration that are 1 and let it flow up. A site x is in 
state 1 at time t if and only if it can be reached by fluid. If there is fluid at y 
at time s and an arrow (with no 6) from (s, y) (s, x), the fluid will spread 
to X, i.e., there is a birth at x if it is in state 0. If x is already occupied no 
change occurs. 

If there is an arrow-d from (s,y) ^ ('S,x), then a little thought reveals 
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Fig 1. Duality for the biased voter model. 


before after 

y X y X 

10 11 

0 1 0 0 

11 11 

0 0 0 0 

In the first case the fluid spreads from y to x as if there was no 5. In the 
second, there is no fluid to be spread but the dam stops the fluid at x. In 
the third the dam stops the fluid at x, but it replaced by fluid from y. In 
the fourth, there is no fluid so nothing happens. Thus the effect in all cases 
is that X imitates y. 

To define the dual, we inject fluid at x at time t and let it flow down. 
It is again stopped by dams but now moves across arrows in the direction 
OPPOSITE to their orientation. Given z G A„ = (L“^Z) x {1,..., M„}, we 
define the dual process 0 < s < t to be the set of sites at time t — s that 
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can be reached by fluid starting at z. A little thought shows that Co ^ = {z} 
and follows the following rules: 

• If a particle in is at x and an arrival in occurs at time t — s 

then the particle jumps to y. 

• If a particle in is at x and an arrival in occurs at time t — s 

then the particle gives birth to a new particle at y. 

• If a jumping particle or an offspring lands on another particle in 
then the two particles coalesce to 1. 

For a picture see Figure 1. There the dark lines give the occupied sites in 
At the end = {—2, —1,1,3}. 

To extend the dehnition to a collection of sites A, we let 
We dehned our dual process starting at a hxed time t so that the relationship 
in Lemma 1 holds with probability 1. If we have two times t < t' then 
the distributions of d’"^ d agree up to time t, so the Kolmogorov 
extension theorem implies that we can define a process C/ for all time so 
that the distribution agrees with d’"^ up to time t. The collection of processes 
{C'^ : Ac A,i} is referred to as the dual process of 
It follows from the definition that 

Lemma 1. = 1 for some z G A if and only if Co (x) = 1 for some 

xcCt ■ 

1.2. Measure valued limit for the biased voter model. We dehne the ap¬ 
proximate density by 


-I -tyj-n 

:= -^'^Ct{w,i) 

^ i=l 

and linearly interpolate between demes to define u^{w) for all rc G M. We 
retain the superscript n on u" to be able to distinguish the approximating 
process from its limit. It is clear that for all t > 0, we have 0 < u2{w) < 1 
for all tc € M and G C'b(M), the set of bounded continuous functions on 
M. If we equip C'fe(M) with the metric 

OO 

(1) ll/ll = d(a;)l 

k=l 

i.e., uniform convergence on compact sets, then C'b(M) is Polish and the 
paths t^u'^ are C'b(M) valued and cadlag. 
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Theorem 1. Suppose that as n ^ oo, the initial eondition Uq eonverges 
in C6(E) to fo and that: 

(a) rnMn/Ll ^ a G (0, oo) 

(b) I3n = Mji/Rn ^ G [0, oo) 

(c) 7n = rn/Ln -)► 7 G [0, oo) 

(d) —>■ oo and L^Rn —^ oo 

Then the approximate density proeess {ui)t>o eonverges in distribution in 
T>([0, oo), C'b(M)), as n ^ oo, to a continuous C'b(M) valued process {ut)t>o 
which is the weak solution to the (stochastic) partial differential equation 

(2) dtu = a Au + 29 fi n(l — n) + |47 n(l — W 

with initial condition uq = ff. Here W is the space-time white noise on 
[0, oo) X M. 

Theorem 1 is a straight forward generalization of a result of Mueller and 
Tribe [38]. They considered a long range voter model on Z/n in which two 
voters are neighbors if |x — y| < Their voters change their opinion at 
rate 0{n) and imitate the opinion of a neighbor chosen at random. More 
precisely, for each of the neighbors, they adopt the opinion of that 

neighbor at rate if it is 0 and at rate + 9n~^l‘^ if it is 1. Their 
model corresponds roughly in our situation to = Rn = rn = 

Their limit is 

dtu = - Au + 29u{l — w) + |4u(l — W. 

6 

/3 = 7 = 1 while the 1/6 comes from the fact that the variance of the uniform 
distribution on [—1,1] is 1/3. 

Why is Theorem 1 true? If this discussion becomes confusing the reader 
can skip it. In Section 3 the analysis of the approximate martingale problem 
will give a more rigorous version of this explanation. 

• To begin we note that only branching events change the expected 
number of ones. They occur from each site at rate 20Mn/Rn- Birth 
events from x to y ~ x with ^t(x) = 1 and ^t(y) = 0 will change the 
local density of I’s by 1/M„L„. In an interval of length h there are 
hLnMn possibilities for x, so using the assumption Mn/Rn /? gives 
the drift term « 29 (3 u{l — u). 

• Under our assumptions, branching events are much less frequent than 
voting events {rn(Ln 7 and LnRn —^ oo imply Vn/Rn^ —>■ oo) 
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SO for the rest of the computation argument they can be ignored. 
Voting events occur from each site at rate 2r„M„. If they go from x 
to 7 / ~ X with it{x) 7 ^ ^t{y) then the local density of I’s will change by 
±1/M„L„. In an interval of length h there are hLnMn possibilities for 
X, so using the assumption rn/L-n —?■ 7 the infinitesimal variance term 
is 4777(1 — u). 

• To see the source of the Laplacian note that in the dual particles jump 
at rate 2 r„M„ by an amount that ± 1 /Ln with equal probability so the 
genealogies converge to Brownian motions with variance 2at. Recalling 
the 1/2 in the generator of Brownian motion we have the term aAu. 

1.3. Convergence of the dual to Branching Brownian motions. Durrett 
and Restrepo [18] have earlier considered a related coalescing random walk 
(with no births) on Z. We use their notation even though it conflicts slightly 
from ours. There are M haploid individuals at each site and nearest neighbor 
migration (i.e., jumps ±1 with equal probability) occur at rate v. Theorem 
2 in [18] states the following. 

Theorem 2. Sample one individual from the colony at 0 and another 
one from the colony at L. Let to be the time at which they coalesce. If L ^ 
00 and Mv/L —>■ a G (0, 00 ) then 2tol{L^ jv') converges in distribution to 
t'[]"^(ar) where lo is the local time at 0 of a standard Brownian motion started 
from 1 and r is an independent mean 1 exponential random variable. 

Why is this true? Since migration occurs at rate u, it tales time of order 
L^/z 7 to move a distance 0{L). In this time the differencesw in the location 
of the two particles will visit each integer between 0 and L of order L/u 
times. If we want coalescence to have a probability strictrly between 0 and 
1, we need Mn/L ^ a G {0, 00 ). 

The proof of Theorem 2 generalizes easily to give: 

Theorem 3. Suppose that as n ^ 00 , the conditions on Rn, Mn 
and Ln in Theorem 1 hold. Then the dual process C of f, converges in dis¬ 
tribution to a limit in which particles move according to Brownian motions 
on the real line M with variance 2at, branching occurs at rate 9(5 and two 
particles coalesce when the local time at 0 of the difference between their 
locations exceeds axj'^, where t is a mean one exponential random variable 
independent of the particle motions. 7/7 = 0 there is no coalescence. 

If we let Xt denote the spatial locations of the particles in the limit process 
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in Theorem 3, then Lemma 1 and Theorem 3 imply 
(3) E[](1 -7/4 (x)) =E n (l-uo( 2 /)). 

Here A can be a multi-set, e.g., {o, a, a, 6, 6}. In this case we start three 
particles at a and two at b and duality says 

E[(l - ut(a))3(l - utib)f] = E J] (1 - uo{y)). 

The duality relationship described in the last paragraph is not new. In 
1986, Shiga and Uchiyama [43] introduced it to study a collection of Wright- 
Fisher diffusions coupled by migrations. Shiga [44] used it to show uniqueness 
in law for (2). Doering, Mueller, and Smereka [12] gave a simpler description 
and derivation of the dual. See also Hobson and Tribe [27]. 

1.4. Tracer Dynamics. Hallatschek and Nelson [25] take an interesting 
approach to studying the ancestral lineage of a particle. That is, the path 
within the dual Csthat gives the actual ancestor at time t — s of the 
individual at x at time t. Think of our expanding population as a fluid and 
inject a small amount of red liquid at time 0. The locations of the red fluid at 
time t will identify the locations of their progeny. In particle terms, we will 
have states 0, 1, and 1* where the * indicates being labeled by the tracer, 
^(x) is the indicator of the particle at x being in states 1 or 1*. To construct 
the labeled process it is convenient to use the graphical representation. If 
there is an arrow-d from y to x then x adopts the state of y. If there is an 
arrow from y to x and y is in state 1 or 1* then x will adopt the state of y, 
but nothing happens if y is in state 0. In fluid terms, the color of fluid at y 
replaces that at x. 

The interacting particle system (IPS) dual process gives us the set of 
possible ancestors of the particle at x at time t. To analyze our new process 
with labeled and unlabeled particles, we assign an ordering to the particles 
in the IPS dual in such a way that the FIRST occupied site in the list 
will be the ancestor. The rules are as follows: 

• If particle i jumps and there is no coalescence, then its location changes 
but the order does not. 

• If particle i jumps and coalesces with j then the particle with the 
higher index is removed from the dual. The surviving particle has its 
location updated. The remaining particles are reindexed. 

• If the i particle gives birth then the new particle is labeled i, while all 
particles with indices j > i have their indices increased by 1. 
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To explain the definition, we will work through the example drawn in Figure 
1. The successive states are 


{!}, {0,1}, {0,2}, {-1,2}, {-1,3,2}, 

{0,-1, 3, 2}, {1,-1, 3, 2}, {1,-1, 3}, {1,-2,-1,3} 


To see this note that under our rules, at the point where the dual jumps 
from {1} —{0,1}, if there is a particle at 0 it will give birth and replace any 
particle at 1. The next two events involve voting, so the affected particles 
move but do not change their position in the ordering. The next event is an 
arrow from 3 to 2. A particle at 3 will replace one at 2, so 3 is inserted in 
the list before 2. The next novel event is the seventh transition when the 
particles at 2 and 1 coalesce: at that time we drop the lower ranked particle. 
For example, the actual ancestor of the particle (t, 1) is 1 if ^o(l) = 1; the 
actual ancestor is —2 if ■^o(l) = 0 and ^o(~2) = 1. 

Neuhauser [40] used a similar dual to show that in the multitype contact 
process, if the particle death rates are equal, then the one with the higher 
birth rate takes over the system. The idea of assigning an ordering to the 
particles to trace their genealogies is quite similar to that in the lookdown 
constructions of Donnelly and Kurtz [13, 14]. There each particle is initially 
assigned a level. Genealogies of any set of particles can be read off from the 
evolution of the levels. See also Etheridge and Kurtz [20] for the lookdown 
constructions for a very general class of population models. 

To do computations, we let rjt{x) = 1 if the individual at x at time t in 
the biased voter model is labeled and 0 otherwise. We only label type I’s, 
so r]t{x) < As noted above r]t{x) can be computed using the version 

of the dual in which the points are ordered. That is, T]t{x) = 1 if and 
only if the first occupied site in the list is labeled. More formally, if under 
the ordering = {yi,y 2 , ■ ■ ■ Vk} (note that K and yi,.. .yK are random 
variables), then 




K j-i 



P{r]t{x) = !) = £' 


= men 
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In the same way one computes 


( 4 ) 


= 0, . . . ^t{Xm) = 0, r]t{Xm+l) = 1, • • • , VtiXn) = 1) 


= E 


llFicn n 

_i=l 


A standard argument shows that the probabilities just computed determine 
the distribution of 

As before, we define the approximate density for the labeled particles by 

I A^tt, 

CM ■= 

i=l 


and linearly interpolate to obtain a function for all re G M. The fol¬ 

lowing is the main result of this paper. 


Theorem 4. Suppose that as n ^ oo, the conditions on Un, Rn, Mn 
and Ln in Theorem 1 hold, and that the initial condition (uq,£q) con¬ 
verges in X C'fe(M) to {fo,go)- Then the pair of approximate densities 

{uf, £f)t>o converges in distribution in L)([0, oo), C'ft(M) x C'ft(M)) to a con¬ 
tinuous X C'b(M) valued process {ut, £t)t>o which is the weak solution 

to the coupled (stochastic) partial differential equations 

dtu = aAu + 2e fu{l-u) + \A-i £{l - W° + jdy (u - £){l - M 

dt£ = a A£ -h 20 f £ (1 - u) + I4j £(1 - + \4j £ {u - £)\P^ M 

with initial condition {uq, £o) = (/o, go), where IT*, i = 0,1,2 are three 
independent space-time white noises on [0, oo) x M. 

As the proof shows the three noises W^, and IT^ refer to voting inter¬ 
actions between 1* and 0, 1 and 0, and 1* and 1 respectively. The drift in £t 
is 20/3 £(1 — u) because labeled particles only have a selective advantage in 
competition with those of type 0. 

We will prove our result by showing that the sequence of approximating 
processes is tight. To conclude that there is weak convergence, we need to 
show 


Lemma 2. When 7 > 0, the solution of the coupled SPDE in Theorem 
4 is unique in law. 
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To do this we use our duality function (4). Details are in Section 8 . 

In Theorems 1, 3 and 4, the assumption a > 0 is used crucially in the 
proof of tightness, but we allow /3 or 7 to be 0. The deterministic regime 
(7 = 0 ) occurs, for instance, if ^ and 

Rn = Mn/f3, where 2a > b > a > 0. When 7 = 0 the limiting process is a 
PDE 


dtu = a Au + 29 (3 u{l — u) 
dfi = a A£+ 2013 i{l — u). 

This PDE obviously has a unique weak solution (solve the first equation and 
then solve the second), but if one wants, this can be proved using duality. 

1.5. Lineage dynamics. Using tracer dynamics Hallatschek and Nelson 
[25], see page 163 and Appendix A, heuristically derived the probability 
density G{x,t\x',t') that an individual at x' at time t' is descended from 
an ancestor at x at time t < t'. Here we describe their argument with 
the hope that someone can find a way to make it rigorous. Making sense 
of equation (5) for the ancestral lineages seems difficult, but since these 
lineages are embedded in the limit of the branching coalescing random walk 
their behavior cannot be too pathological. 

The authors of [25] considered a more general situation in which the 
population density in a frame moving with velocity v is 

dtu{x, t) = Dd1u{x, t) + vdxu{x, t) + K{x,t) 

where, for instance, K{x,t) = su{l — u) + e-sju{l — u)Z with Z being a 
space-time White noise, and we have changed their c to u. They concluded, 
see their (3), that 

dtG{x,t\x' ,t') = —dxJ{x,t\x',t') 

J{x, t\x', t') = —DdxG -|- {u -|- 2Ddx log[M(x, t)]}G. 

Here {x'R') is thought of as the initial condition and (x,t) as the finial 
condition, so this is the forward equation 

(5) dtG = DdlG - dx{{v + 2Ddx\og[u{xR)]}G) 

and the drift in the diffusion process is u -|- 2Ddx log[c(x,f)]. 

As we will now show, a closely related equation “follows” from Theorem 
4. We used quotation marks because we use the nonexistent Ito’s formula 
for the SPDE in Theorem 4 and apply it to the function f{l,u) = Iju which 
is not continuous at (0,0). 
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Suppose {u, 1) solves the SPDE and let p = i/u. By Theorem 4, 
udti — idtu 1 


dtp = 




U L 


+ |47£(u-£)|^/2 ^2 


aM + 2e + | 47^(1 

a Au + 2913 u(l — u) 


+ 147^(1 - iy° + |47 (u jyi 


The terms involving 9/3 cancel. To combine the Laplacian terms we use 
the formula 

\uj u u \u 


To add up the noises we note that since the VP® are independent, the vari¬ 
ances add up to 


|4 niZ 




i{u - 1) 


u — f)(l — u) ^ £(1 — u) 








l{u -1) 




since {u — £)(1 — u) + £{1 — u) + v? = tt(l — u) + v? = u. Combining our 
calculations, 

(6) dtp = aAp + 2adx\ogu ■ dxP + |47p(l — p)/u\^^‘^ W 

for some white noise VP. To compare ( 6 ) with (5), note that their D = a, 
they work in a moving reference frame and their equation is for a fixed 
realization of the total population size; while ours is in a fixed reference 
frame and does not condition on u{t, x) and hence retains the fluctuation 
term |47p (1 — p)/u\^/‘^ VP. 

Equations (5) and ( 6 ) both contain drift terms of the form 5a; log[u(x, t)]. 
It is a well-known fact that solutions of the SPDE in (2) are Holder contin¬ 
uous with exponent 1/2 — e in space and 1/4 — e in time, so it is not clear 
how to make sense of these equations. The fact that pt{x) < it{x) means 
that £< u, so in computing £ju we will never divide a positive number by 
0. However solutions to u have compact support [37], so it is not clear if the 
ratio of densities £+ /'u+ will be tight. 


1.6. Concluding remarks. Theorem 1 is a special case of Theorem 4. The¬ 
orem 3 is proved in Section 2. The rest of the paper is devoted to the proof 
for Theorem 4. In Section 3 we introduce stochastic integral representations 
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for and formulate approximate martingale problems. This is now 

a common approach in the study of scaling limits of particle systems, see 
[17, 11, 16]. The calculations for the u equation are almost identical to those 
in Section 3 of Mueller and Tribe [38], but some minor changes are needed 
to study the joint distribution 

In Sections 5-7 we prove tightness. Again many of the ideas come from 
[38], but since they only write out the details for their contact process limit 
theorem, and we have to prove the joint convergence, we have written out 
the details. See Kleim [31] for a recent example of convergence of rescaled 
Lotka-Volterra models to a one-dimensional SPDE, this time with a cubic 
drift term. The main ideas of the tightness proof are given in Section 5. 
Two lemmas that require a lot of computation are proved in Section 6. 
Some nonstandard random walk estimates are proved in Section 7. Finally 
Lemma 2, which establishes distributional uniqueness for the coupled SPDE 
by using a duality based on (4), is proved in Section 8. 

Acknowledgement. We would like to thank Carl Mueller and Edwin 
Perkins for their help while we were writing this paper. Two referees made 
many detailed comments that improved the readability (and correctness) of 
this paper. 

2. Proof of Theorem 3. To get started suppose that there are no 
births in the dual and consider the special case in which there are initially 
two particles. Let be a random walk that jumps from x to x ± 1/L„ at 
rate 2r„M„. The reader should think of this difference of the location of two 
particles (hence the factor of 2 in 2r„M„), but we allow the two lineages to 
move independently even after they hit. An elementary computation shows 
that if 



ISf I — is martingale. As t —oo, jS"”] converges to the absolute value of 

a Brownian motion Bt with variance Aat. 


The next result has been proved for finite variance random walks by 


Borodin [4] in 1981. To keep this paper self-contained, we will give a simple 
proof for the nearest neighbor case. 

Lemma 3. converges in distribution to io{t) the local time at 0 for 
the Brownian motion Bt. 


Proof. An easy computation for simple random walk shows that for any 
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stopping time 

-Vs< EoVr < cVt 

so by Aldous criterion (see e.g., Theorem 4.5 on page 320 of Jacod and 
Shiryaev [28] the sequence is tight. Let be a convergent subsequence 
with limit Vt- — is a martingale. Using the maximal inequality on 

the random walk, and the dominated convergence theorem on the increasing 
process, both processes converge to their limits in L^. Since conditional 
expectation is a contraction in L^, it follows that \Bt\ — Vt is a martingale. 
Now io{s) is the increasing process associated with \Bt\ See e.g., (11.2) on 
page 84 of Durrett’s Stochastic Calculus book [15]. By the uniqueness of the 
Doob- Meyer decomposition, there is only one subsequential limit, so the 
entire sequence converges in distribution to ■^o(^)- Cl 

The sojurn times at 0 are independent and exponential with rate 4r„M„, 
so the number of visits to 0 up to time t 

N]} ~ 4rnMn [ l(s-= 0 ) ds 

Jo 

and it follows that N^/Ln ^o{t)- On each visit to 0, the two particles 
have a probability 1/M„ to coalesce. Our assumptions imply that 

Mn f n djji y 

—— • —— —y Oi so ———)■ — 

-^n ^ 

and the desired result has been proved for the processes with no births. To 
add births we note there are cells at each deme and Pxy has rate 0R~^, 
so the branching rate at each deme is 6Mn —>■ Ofi. 

3. Approximate Martingale Problems. For simplicity we drop the 
subscript n’s on M, L, R and r. We leave the superscript n in u” and to 
distinguish the approximating processes from their limits. We write 

{f^g)-=^ f{w)g{w) 

w£L~^'E 

whenever (j/j, j^j) < oo and adopt the convention that := 4>{w) when 

X = (tc, i). 
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3.1. Type 1 particles. To develop the martingale problem, we note that 
dynamics of can be described by the equation 

(7) &(!•) = {(,(1) + 

yr-^X 

yr^X ^ 

In the first integral, if Cs-(2/) = then nothing happens. 

If is-{y) = 1 and is-{x) = 0 then = 1; 
if ^s-{y) = 0 and = 1 then ^s(x) = 0. 

In the second integral, nothing happens unless is-{v) = 1 and Cs-{x) = 0. 
In this case ^six) = 1. 

Let (j) : [0, oo) x M —)• M be continuously differentiable in t, twice continu¬ 
ously differentiable in x and have compact support in [0,T] x M for any T. 
Applying integration by parts to ^t{x)4>t{x), using (7), and summing over x, 
we obtain for all t G [O.T], 

{ut, 4>t)-{uQ, (/)o) - [ {Us,ds4>s)ds 

Jo 

( 8 ) = f\^s-{y)-Cs-ix))Mx)dPty 

X y^x 

(9) + (ML)-i 6-(2/)(1 - is-{x))(t)s{x) dP^’y. 

X y^x 

Drift term. We break (9) into an average term and a fluctuation term 

(10) (ML)-^ ^s-{y){'i--^s-{x))(l)s{x)9R ^ ds 

X y'^x 

( 11 ) +(ML)-i seT 6-(y)(l - 6-(x))<(>s(x) (dPf’" - 9R-^ ds). 

Recalling the definition of the density, (10) becomes 
M 1 P 

^ / K-(«^-^"') + <-(tc + L-')](l-<_(^))<().(a^)ds 
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Since M/R ^ (3, this converges to 

0(3 / 2us{w){l — Us{w))(f>s{w) dw ds 

Jo Jr 

as n —>■ oo. Here, and in what follows, the claimed convergences follow once 
we have proved C-tightness (see Section 5 for the proof). We have established 
convergence of finite dimensional distributions so the sequences of processes 
converge in distribution. 

The second term (11) is a martingale {4>) with 

< —— / (1, ((>1) ds ^ 0 since LR —)• oo. 

LR Jo 

White noises. We can rewrite ( 8 ) as 

( 12 ) (ML)"^ { 6 - {y) [1 - 6 - {x)]-Cs-(x) [1 -^s- {y)]}4’s (x) dPf 

X y^x 

We now rewrite the integrand as 

(13) 6-(2/)[i - is-{x)](l)s{y) - ^s-{x)[i - ^s-iyMsix) 

(14) + 6-(y)[l - 6 -(a^)](<(>s(a;) - My))- 

We work first with (13). Interchanging the roles of x and y in the double 
sum, letting ^^{z) = 1 — ^t(z), and writing — Ps’^ this part of 

( 12 ) becomes 

(15) (ML)-i C-iy)Mix)(j)s{x) dQt’^. 

X y-^x ^ ^ 

To prepare for treating the joint SPDE we split (15) into Z^{(f)) + ZlM 
where the Zl((j)) are defined by the next two equations 

( 16 ) {MLMY.Y. f es-{y)ys-{x)Mx)dQr, 

(17) iMLMY.Il f es-{y){is-ix)-ys-ix))Mx)dQr- 

X y^x ^ 
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These two martingale terms use the same Poisson processes but the product 
of their integrands is 0 since (1 — ^t{x))rit{x) vanishes for all x and t. Hence 
these martingales are uncorrelated. 

The variance process = 2rt. Our assumptions on (p imply that 

suPsg[o,i] \4>six) — 4>s{y)\ < Kt/L for some Kt < oo, so ignoring the difference 
between 4>s{x) and 4>s{y), we have 

{Z°{(j)))t = 4rL-^ f - u^_{w))(j)siwf ds + o{l) 

weL-^z 


which converges to 

47 / / is{w){l — Us{w))4>si'w)‘^ dw ds since r/L —>• 7 . 

Jo Jr 

Similarly, {Z^{(j)))t converges to 

47 /" f {us{w) - £s{w)) - Us{w))(l)siwf dwds. 

Jo Jr 

Laplacian Term. We denote the discrete gradient and the discrete 
Laplacian respectively by 


(18) VLfiw):=L(^f{w + L^)-f{w)^ 

(19) AlHw) := (^f{w + L~^) + f{w - L~^) - 2f{w)^ . 


We break (14) into an average term and a fluctuation term 

(20) (ML)”i ^s-{y)^'s-{xMs{x) -(l)s{y)]rds 

X y^x 

(21) + (ML)-^ SEf ^s-{y)PJs-{x)[(t)s{y) - (t)s{x)]{dP^'y -rds). 

X y^x^^ 

We can replace by 1 in (20) without changing its value, because by 
symmetry, T.xT.yr.x^s-{y)is-{x)[(t)s{x) - (ps{y)] = 0 for all s > 0 . 

Doing the double sum over y and then over x ^ y the above is 
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By assumption rM/L^ a, so this term converges to a The 

other term, ( 21 ), is a martingale with 

(22) {£C)(^,)),< " /■' ((/>s(x) - (j)siy)fds 

^ ' X y'^x ^ 

2r 

(23) =J^ j^{l,\S/L(t>s?)ds^^ 
since rjL —)• 7 and L —)• 00 . 

Combining our calculations, we see that in the limit n —?> 00 , 


(24) 


/ ut{w) 4>t{w) dw — uo{w) 4>o{w) dw — / / Us{w) ds4>s{w) dw ds 

1/ M J 0 'J M. 

— / Q;tts(tc)A0s(rc) — 20/3us(r(;)(l — tts(rc)) ())s(t(;) dtc ds 

do dR 


is a martingale with quadratic variation 
(25) 


47 / / tts(rc)(l — Us(t(;))(?!)^(t(;) dtc ds 

do dR 

which is the martingale problem formulation of (2). 


3.2. Labeled Particles. To begin, we note that in terms of the previously 
defined Poisson processes, 

(26) r]t{x) - r]o{x) = [ ^ds-iv) - Vs-ix)) dP^’^ 

+ ds-{y){^ - ys-{x)) - 6-(y)(i - ys-{y))ys-{x) dP^’^. 

y^x ^ 

The first term gives the voter interactions. For the second term, note that 
if y is in state 1* and x is not, the number of l*’s will increase by 1 , while if 
X is in state 1* and y is in state 1 {^s-{y) = 1 and rjs-iy) = 0), the number 
will decrease by 1 . 

Arguing as for the type 1 particles while using (26) instead of (7), we get 

do 

(27) = (ML)-^ SEf Pls-{y) - r]s-{x))4>s{x) dPpy 

X y'-^x 

+ (ML)-^ [vs-(y) - 6 - {y)vs-{xMs (x) dP ^’^, 

X y'^x 


( 28 ) 
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where we have simplified the second term of (26) using ^s-{y)Vs-iy) = 
rjs-iy)- 

Drift term. Breaking the second term (28) into an average term and a 
fluctuation term as before, we conclude that as n ^ oo, the average term is 


OM 

Tr 


f[C{w + L-'} 


u^{w + L-^)i^{w) 


+ i^{w — L ^) — u^{w — L ^)i'^{w)]4>s{w) ds 


which converges to 

op f f 2ls{w){l — Us{w))(pa{w) dw ds. 

Jo Jr 

White noises. We again change the integrand in (27) to 


{Vs-{y)[l - Vs-{x)] - Vs-{x)[l - 7]s-{y)]} Psix) 

and then split it into two parts as in (13) and (14). That is, we rewrite the 
integrand as 


(29) Vs-{y)[^ - ys-ix)]r]siy) - Vs-ix)[l - r]s-iy)]psix) 

(30) + r]s-iy)[l - r]s-ix)]iPsix) - psiy))- 


Arguing as before, we obtain the following sum coming from (29). 

(31) (ML)-^ [1 - Cs-iy)]r]s-ix)psix) (dP|’^ - dPp’y) 

X y'^x 

[6-(y) - ris-iy)]ys-ix)Psix) (dP^'^ - dPp’^). 


The first noise is the same as Z^ip) in (16) while the second noise, denoted 
by Z^ip), has variance converging to 


47 /" [ isiw) (usiw) - £siw)) psiw)"^ dw ds. 
Jo Jr 


The product of any two of the three integrands in ZHp) {i = 0,1, 2) is 0, so 
these three martingales are uncorrelated. 

Laplacian term. Breaking (30) into an average term and a fluctuation 
term as was done for (14), we see that as n —)• 00 , the average term 
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Combininff our calculations, we see that in the limit, for any 6, ih ^ 
Ce'’"([0,oo) xM), 


(32) 


/ {ut 4>t -uo4'o + it tpt - 4 V’o) dw 

Jr 

-a / Usids(l)s + + is{dsits + A'iljs)dwds 

Jo Jr 

-26 P Us {I - Us) 4>s + is i'i- - Us) P’s dw ds 

Jo Jr 


is a continuous martingale with quadratic variation 

(33) 47 / f Us{l-Us)(l)i + is{l-is)Ps + ‘^PsPsis{'i-Us)dwds. 

Jo Jr 

Any sub-sequential limit {u, i) solves this martingale problem. It is standard 
(see p. 536-537 in [38]) to show that {u, i) then solves the coupled SPDE in 
Theorem 4 weakly, with respect to some white noises. 


4. Green’s function representation. As remarked earlier our proof 
follows the approach in [38]. The hrst step is to prove the analogue of their 
( 2 . 11 ). Observe that u2{z) = (tt”, L 1 ^) and ^f(z) = (f^ Tlz), where is 
the function on which is 1 at z and zero elsewhere. Let 
which converges to a as n —)• 00 and let 


(34) p^w) :=LF{Xp = w\X^ = 0) 

be the transition probability of the simple random walk (Aj"'^)t>o on L~^'L 
with jump rate 2LP, so that it converges to pt{w) the transition density of 
Brownian motion run at rate 2. Let {Pp}t>o be the associated semigroup 
which has generator the discrete Laplacian dehned in (19). 

Applying the approximate martingale problems with test function 


(35) 


psiw) := 4>Y{w) : = 


\pl^{t-s)i'^ - ^) fors€[0,t], 
I 0 otherwise 


and using the facts that dsps + ctnAips = 0 and {uq, = P^^iUq{z), we 
have 

(36) <(z) = + YtiP) + ZPP) + e[^\ 4>) + Ef\p) 
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for t > 0 and 2 ; G L~^Z. Here Zt{(t)), and E^‘^\(j)) are martingales de¬ 

fined in (15), (21) and (11) respectively, with (/>s defined in (35). To describe 
the other term, we let /?„ = MR~^ which converges to /3, and let Yt{(f>) be 


(37) 



+ u^(w + L-^)]{l 


u^{w))(f)s{w)ds. 


Repeating the last argument for the labeled particles, 

(38) i^iz) = PZMz) + Y/icP) + (</.). 

Here zf\4>) := Z^{4>) + Z^{(f)) is given by (31), e\^’‘^\(I)) is obtained by 
replacing by (r/, tj'^) in (21), Ef’‘^\(j)) is the fluctuation term corre¬ 

sponding to (28). The remaining term is 


i'/w = ^r E [cu + i-‘)-<u + i-‘)C(to) 


w£L~^1i 


+ ig{w — L ^) — u^{w — L (j)s{w)ds. 




5 . Tightness. Recall that a sequence of probability measures is said to 
be C-tight, if it is tight in D and any subsequential limit has a continuous 
version. The goal of this section is to prove: 


Theorem 5. Suppose the assumptions in Theorem 1 hold. Then the 
sequence {(«"■, T^)}n>i is C-tight in Z1([0,T], C'b(M) x Cfc(M)) for every T > 

0 . 


Proof. The compact containment condition (condition (a) of Theorem 
7.2 in [21, Chapter 3]) holds trivially as 0 < < 1. Since C'f,(M) xC'fc(M) 

is equipped with the product metric, the desired C-tightness follows once 
we can show that for any e > 0, one has 


(39) 

lim lim sup P 


n—)-oo 

(40) 

lim lim sup P 


n—)-oo 


( sup 
V t\—t 2 <s 

0<t2<ti<T 





= 0 , 


{ sup =0. 

V tl—t2<S / 

0<t2<ti<T 


Here and in what follows the norm is the one defined in (1). It is enough 
to show that (39) holds with replaced by any term in the decomposition 
given in (36), and that (40) holds with replaced by any term in (38). 
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First term in (36) and (38). By standard coupling arguments for simple 
random walk, we can check as in Lemma 7(b) of [38] that, upon linearly 
interpolating PJ^^^Uq{z) in space, we have 

(41) sup ||-P"^tUo - P„„t/o||-t 0 as n-5-00, 
te[o,r] 

where {Pt}t>o is the semigroup for the Brownian motion in M running at rate 
2, and /o is the initial condition for u functions in Theorem 4. This implies, 
by the continuity of the semigroup P*, that (39) holds with replaced by 
P^^iUq. By the same reasoning, we have 

(42) sup ||Pa„i^o - Pa„t£/oll-t 0 asn-^oo 
te[o,r] 

where qq is the initial condition for ^ functions in Theorem 4. 

Remaining terms in (36) and (38). For simplicity, we write 
u^{z) := Yt{<P) + Zt{(^) + PfV) + 

r,{z) := vPict^) + Zf\<p) + Pf ')(,/.) + Pf’") (</.). 

The next moment estimate for space and time increments is similar to 
Lemma 6 in [38], but ours implies Holder continuity of the limits with ex¬ 
ponent < 1/2 in space and < 1/4 in time. 

Lemma 4. For any p > 2 and T > 0, there exists a constant C{T,p) > 0 
such that 

(43) E\ul[{z,) - v^^{z2W < Ct,p (iti - + Ip - P]^/' + M-P) 

(44) E\il{zi) - Il{z2W < Ct,p (iti - + Ip - P]^/' + M-P) 

for all 0 < t 2 < ti < T, zi, Z 2 ^ P“^Z and n> 1. 

The proof of this result is postponed to the next section since it requires a 
number of computations. We now argue that (43) implies (39) holds for vP. 
This idea is described in the paragraph before Lemma 7 in [38] and page 648 
of [31]: we approximate the cadlag process vP by a continuous process u and 
invoke a tightness criterion inspired by Kolmogorov’s continuity theorem. 

Lemma 5. Define vP G (^([Ojoo), C'fc(M)) by ut = ut on the grid t G 
On Z+ and then linearly interpolate in t for each tc G M. Suppose On > M~‘^ 
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and lim^^oo = 0. Then there exists no € N such that for any p > 2, T > 0 
and K > 0, 

(45) - uf^iz2W < CT,p,K(\ti - t2\P^‘^ + |2;i - 2 : 21 ^/^), 

for all 0 < t 2 < ti < T, Zi & M with \zi\ < K (i = 1,2) and n > uq. 

By a standard argument (see, for example, Problems 2.2.9 and 2.4.11 of 
[29]), one can show that (45) implies that (39) holds when u is replaced by 
vT. Finally, by the reasoning in the proof of Lemma 7(a) in [38], there is a 
cr > 0 so that 

limsupPf sup \\uf — uf\\>n~^'\ =0. 

n=foa Me[0,r] ' 

Therefore (39) holds for vT. By the same argument, (44) implies (40) holds 
for £”■. Theorem 5 will be proved once Lemmas 4 and 5 are. □ 

6. Proofs of Lemmas 4 and 5. 


Proof of Lemma 4. We prove only (43) for unlabeled particles. The 
proof of (44) for labeled particles is similar. The basic ingredients are the 
following estimates of time and space increments of the transition probability 
of simple random walk. Namely, there exists a constant C > 0 independent 
of n such that 


(46) 

(47) 

(48) 

(49) 

(50) 


0< / p^iO)ds<CVT, 

Jo 

poo 

0< / p”(0)-p”+0(O)(is < Cv^, 

poo 

0< / pm-Pl{z)ds<C\z\ 

f < CVT~e, 

w 

- p'^(^ + '^)\ds < CVt\ 


for 0 > 0, z G L~^'L and T > 0. These estimates can be either found or 
deduced from the standard methods described in Chapter 2 in [35]. For 
completeness, we give precise references and missing details in the next sec¬ 
tion. 
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We will show that each of the four terms of satisfies (43). To simplify 
notation, we assume, without loss of generality for the proof, that = 1 . 
First, we deal with the process Y that has no jumps. To reduce the size of 
the formulas we let 


v^{w) = K(u; - L-i) + u^{w + L-i)](l - <(rc)). 

Using the definitions of Y (37) and of our test function (35) we have for 
ti > t2 

= OPn r jY^VsHpl-sizi - w) ds 
+ 0Pn [ -Pt^_s{z2-w)]ds 

= 9(3niei{Y) + e2{Y)). 

The sums are over w G L“^Z, which we have omitted to simplify notation. 
Since 0 < «"■ < 1 and L~^ YlwP?i^ — w) = 1 (recall the definition of p” in 
(34)), we have 

(51) E\e,{Y)\P <2P{ti-t2Y. 


by Holder’s inequality. 

By the triangle inequality and the translation invariance and symmetry 
of the transition density, we obtain 

[ -Pt2-s{z2-w)\ds 

< [ {\Pu-sizi - Z2 + w) - p2^_^{w)\ + \p'^^_^{w) - p'l^_Yw)\) ds 

< Ct\zi — Z2\ + Cy/tl — t2 

by (50) and (49). Since 0 < < 2, 

(52) E[|02 (y)r] <CT,p{vtr^2 + \zi-z2\Y 

for all 0 < t 2 < U < T, -^ 1 , ^2 U L~^'L and n > 1. 

It remains to consider Zt{ 4 >). Note that for each of 

them, the largest possible jump is bounded almost surely by 

2{ML)~^ sup \\4>s\\oo < 2M“\ 

s>0 
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because cps, as defined in (35), is bounded by L. 

We shall employ a version of the Burkholder-Davis-Gundy inequality 
stated at the bottom of page 527 in [38]. Namely, for any cadlag martin¬ 
gale X with Xq = 0 and for p > 2, 


(53) E[ sup |X,|P] < C{p)e\ + sup |X, - Xs-f 

sG[0,t] ^ sS[0,t] 


, t > 0. 


Writing for the compensated Poisson process — VnS, and Ylxy-^x 
for the double sum Yhyr^x-: decompose 

= mT, I" (pI-M - y) - - x)) dNpy 


x^y^x 


+ {p7,-s{zi - y)-Pt,-s(.zi - x) 

- Pt 2 -s(z 2 - y) +Pt 2 -s{z 2 - x)) dN^’y 


x^y^x 


= 0i(P«) + 02(p(^)). 

Writing for the compensated Poisson process — 9R~^s we have 

_ p( 2 )^^^ 2 , 22 ^ 




1 

Wl 


rti 


T. Hr^-T. ^2 


x^y^x 


+m S r{.-fa)(i-6-(x))K-.(^i-L-pS-.(22-i^)l<ijvf» 

x^y^x 

= 01(P(2)) + 02(P(2)). 

Finally, writing Qs'^ = Ng'^ — Ng’^ we have 
^t2{ 




x^y^x 


't2 


1 




+ li7I E / (l-6-fe))6-W[f>?.-,(2i-a:)-p,“_.(22-x)|d«-» 

O' 'llr^'T " 0 


x^y^x 

= Qi{Z) + 02(Z). 




GENEALOGIES IN EXPANDING POPULATIONS 


25 


Once we use 0 < ^ < 1 to simplify the integrands the three expressions 
have a similar structure. will be the smallest since it has a difference 
of transition densities at adjacent sites, so we begin by estimating To 
estimate we let 




1 

Wl 


X,yr^X ^ 


By Markov property of {^t)t>o and the stationarity of the compensated Pois¬ 
son process, 

where E ^‘2 is the expectation w.r.t. the law of ^ starting at ^t 2 - 

To prepare for the next calculation we note that using the symmetry of 
the transition density and the Chapman-Kolmogorov equation 


(54) = i-‘ EU(t»)K(-a>) = P2«(0). 

w w 

The predictable bracket process of is 

Since there are 2M values of y for each x and M values of x for each 
w G if we let c„ = 26 /then the above is 

<Cn [ L-^ Y1 {PH-t2-s{w)f ds 

do 

= Cn ^ P2(t,-t2-s)i^) ds<CnC {h - 

for t G [0, ti — t2], where in the second step we have used the (54) and in 
the last step we used (46), Note that c„ ^ 0 since RnLn oo. 

If we do the calculation for Z then Cn = 2rnlL —)• 27 so we get the same 
upper bound. In E^^\ Cn = TnjLn —)• 7 but we have p\_s{zi — y) —— 
x) instead of a single p, so 

d-~^^[pli-t 2 -s{w) - Pl^_t2-s{w - L-^)]^ 

W 
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If we use (50) now we would get an upper bound of CtL~^ when we integrate 
the above expression with respect to ds from s = 0 to s = T. Therefore, by 
(53), we have 

(55) 

for and Z. 

Similarly, IE(|02(£i(2))|?’) = ¥.{\X‘i^\P), where 

•’^<" = 171 E f i.-(v)iUx)\i>l-X^i-x)-pl_Xz2-x)\dNp« 

is a cMlag martingale for t <t 2 with predictable bracket process 

<Cn [ L-^ Y^lPti-sizi -w)- Pt^_siz2 - w)f. 

•^0 

Arguing as before using (54) we get 

< P 2 (ti-.)( 0 ) + P2{t2-s)(.^) - ‘^Pt2+U-2sizi - Z2) 

a result that also holds for and Z. Adding and subtracting 2p^^_^_^^_2g{0) 
and using (47) and (48) the above is at most 

Cy/h — t2 + Ct\zi — Z2\)- 

Using (53) now, we have 

(56) IE(|02r) < Cp,T{\tl - + kl - Z2\P/^ + M-P) 

which holds for E ^^'^, and Z and the proof is complete. □ 


Proof of Lemma 5. It suffices to consider the case |ti — < M~^ 

and l^i — ^ 2 !^^^ < M~^, since otherwise Lemma 4 easily implies (45). The 
triangle inequality gives 


(57) \uliz,) - UI{Z 2 )\ < \uliz,) - + \ul{z,) - UI{Z 2 )\. 


We first estimate the time difference on the right. Write Sk ■= kOn for k G Z+. 
Since |ti — t 2 | < < On, we have either Sk < t 2 < ti < Sk+i for some k 

or t 2 < Sk < ti for some k. Since lE is linear between grid points, in either 
case 


KM - < 2(|®L.(2) - KM\ V KM - U._.(2)l) 


|U — ^2! 
On 
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Hence Lemma 4, the assumption M ^ < 6^ and \ti — 12 \ < M ^ imply that 


nuliz) - ul{z)\P < C't,^ + M-P) 


< Cta 


Bl 

t/r). 


Next, we estimate the space difference on the right of (57). Take n large 
so that M~^ < (1 + l)L~^ and (1 + 7 )^L“^ < L~^ (this is possible by our 
assumptions on scalings, even if 7 = 0). Then \zi — Z 2 \ < (1 + 7 )^L“^ < L~^. 
By almost the same argument used above, we easily obtain 


E\u^izi) - ur(^2)r < C{T,P) 121 - Z2\pLP{L-p/^ + M-P) 
<C{T,p)\zi-Z2\P{LP/^ + {l + ^r) 

< C{T,p) \ (1 + 7)p/2 |21 - 22 !="^/^ + (1 + ki - 


^2 


The proof of Lemma 5 is complete. 


□ 


7. Random walk estimates. The first two, (46) and (47), follow di¬ 
rectly from the local central limit theorem (LCLT) (see, for example. Propo¬ 
sition 2.5.6 in [35]). (49) follows from (50) and the Chapman-Kolmogorov 
equation: the integrand can be written as 

pl+eH-pliw) = j Y 1 Pnz){p^iw-z)-p^iw)). 

It remains to prove (48) and (50). 

By scaling, Pt{w) = Lpi 2 ^{Lw) where pt{k) is the transition density of 
the simple random walk on Z. The integral of (48) is therefore 

^l\m-piiLz)ds. 

Splitting this integral into two parts according to whether s < L\z\‘^ or s > 
L| 2 p, the first part is bounded by L~^ P”(0) ds < C\z\/y/L according 
to (46). The second part is bounded by C\z\ by the LCLT. 

Formula (50) is similar to Proposition 2.4.1 in [35] which says that 

V \qm{k) - qm{k+j)\ < ^L, 


(58) 
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where qm{k) is the transition density for the discrete time simple random 
walk on Z. Hence, by using scaling and an independent Poisson process Nt, 
we rewrite the left hand side of (50) as 


■4 r 


1 

I 2 


EIE ipw 

fcgZ m=0 


= m) {qmik) - qm{k + Lz)) \ ds 


which is at most 


C\z 

~17 



F{Ns = m) 
y/m 


ds 


by (58). Arguing as in the proof of Proposition 2.5.6 in [35] by using Propo¬ 
sition 2.5.5 (the LCLT for Poisson processes), we obtain that the integral is 
of order VL‘^T and hence (50) holds. 


8. Proof of Lemma 2. To prepare for the proof for the SPDE, we 
begin by considering the diffusion process 

(59) dU = /3C/(1 -U)dt + a^U{l - U) dB. 

Following the approach of Doering, Mueller and Smereka [12], we change 
variables Z = 1 — U to get (recall dZ = —dU) 

(60) dZ = -(3Z{1 -Z)dt- a^Z(l - Z) dB. 


The minus in front of the diffusion term is not important here but it will 
be in the next calculation in (63). Using Ito’s formula and ignoring the 
martingale terms 


drift(Z™) = mZ^-\-^Z{l - Z)) + m{m - 

= (SmlZ-'+i - Z-l + 


2 

l)Z™“2^Z(l - Z) 
[ Z "*-1 - Z ”^]). 


Let N{t) be a Markov process with Q matrix 


Q 


m,m+l 


Q 


7n,m — 


= l3m 

—/3m — 


Qm,m—1 

m{m — 


r,mim — 1 ) 
“ ^ 2 
1 ) 


(61) 


2 
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Combining our calculations, 

j^EZ^ = Y,Qm,nEZ^ 

n 

Letting Pe,m{t) = ^{N{t) = m\N{0) = i) he the transition probabilities, 
we have 


Lemma 6. For fixed T > 0 and d >1, Mt = Ylm=i Ee,m{T — t)Z'^{t) is 
a martingale. 

Proof. Differentiating we have 


j^EMt = Y, EZ^{t)j^Pe,m{T -t) + Pe,m{T - t)j^EZ^{t) 

m 

= Y -EZ^{t) Y PhniT - t)Qn,m 

m n 

+ Pe,m{T — t) Y^ Qm,nEZ'^{t) = 0 


if we interchange the roles of m and n in the second sum. □ 

From Lemma 6 we get 

(62) EZ^^^\T) = EZ^^Efi)fi 
Now consider the system 

(63) dZ = -fiZ{l- Z)dt-aVvZdB^ 

dV = fiVZ dt + aWZdB^ + ay/V{l -V -Z)dB'^, 

where the B^ are independent Brownian motions. To get our second dual 
function, we consider Yn = J2m=i Using Ito’s formula and for the 

second term recall (60), 


\m=l 


Y Z^-^V j = u ^ (m - l)Z^-^{-(3Z{l - Z)) 

/ m =2 

2 ^ 

+ —V ^ (m - l)(m - 2)Z^-^Z{1 - Z) 

n n 

+ Y Z'^-^fiZV - CT^ ^ (m - l)Z'^-^VZ, 


m=l 


m=2 


drift 
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where the last term comes from the fact that the covariance of Z and V is 
—a^VZ. Collecting the terms with /3, and changing variables A: = m — 1 in 
the second sum we get 


n—1 


.m=2 


= pv 

Adding the third sum to the hrst 
= /3V 


^ (m - 1 )Z™ -Y,kZ^ 


k=l 


m=l 


n n—1 

^ mZ^ - ^ kZ’^ 

.m=l k=l 

/ n+1 n \ 

n(3VZ^ = n/3 ^ Z™” V - ^ V 


\m=l 


m=l 


which corresponds to jumps from n to n +1 at rate /3n. Collecting the terms 
with and changing variables to have Z^~^, we get 


= uV 


n—l 

E 


kik — 1 ) r^k-i (k — 1 )(^ “ 2 ) k-i 






_fc=2 k=3 k=2 

Moving terms A: = 2 to n — 1 from the last sum into the first one 

= ~ ~ Z^-^ - E ~ ~ Z’^-^ - (n - 1 )Z 

.k=2 k=3 

The k = 2 term in the hrst sum vanishes. Terms 3 to n — 1 in the hrst sum 
cancel with the second sum leaving 

'n—1 n \ 

z™-V - Y ) 

\m=l m=l / 

which corresponds to jumps n to n — 1 at rate a^n{n — l)/2. 

Combining our calculations 

d 


2 ,^n(n- 1 ) 

= —a V -Z = (7 - 


■jrEYn = ^ Qm,nEYn. 


dt 

m 

where Qm,n is given in (61). Using Lemma 6 again, 

/N{T) 


/n( 0 ) \ /N(T) 

E Y Z^-^{T)V{T) = S X] ^™“^(0)f^(0) 1 . 

\m=l / \m=l 
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8.1. Duality for the Wright-Fisher SPDE. We begin by proving the du¬ 
ality result for the equation for single SPDE (2). This is a known result due 
to Shiga [44]. However, he did not give many details and we need to general¬ 
ize his result to our coupled SPDE, so we will follow the approach of Athreya 
and Tribe [Ij. Let z = 1 — u. Define zt{x) = f zt(y)p^(y — x} dy where is the 
normal density with mean 0 and variance e. Noting that x —>■ zt{x) is smooth 
and using the weak formulation of (2) with test function = p^{y — x), 

we have 


zt{x) — zo{x) = / aAzs{x) ds 

Jo 

( 64 ) j ~~ 

+ J J \/ ^JZsiy){l - zs{y))p^{y - x) dW. 


Using Ito’s formula (each zt{x) is a semi-martingale so this is legitimate) 
and writing Cz for the generator of {zt{xi )^..., z{xn)) with xi ,.. .Xn hxed, 
we see that (ignoring the martingale terms) 


drift ( CzWzt{xi) I = '^W_zt{xj)aAzt{xi) 
\ i / i=l j^i 

(65) 


n « 

+ 2 e/ 3 '^Ylztixj) / [z^{y) - Zt{y)]p^{y - Xi) dy 
i=i j^i J 

n—1 n « 

-i- 47 SE n Zt{xk) J [zt(?/)(l - zt{y)W{y - Xi)p^{y - Xj) dy. 


i=l j=i+l k^i,j 


The dual process is a system of branching coalescing Brownian particles. 
During their lifetime the particles are Brownian motions run at rate 2a 
with each giving birth at rate 20/3. In addition, for i < j, particle j is killed 
by particle i at rate where denotes the local time of the process 
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Xj — Xj at 0. Writing for the generator, we have 


drift I Cx'^zti 


Xi) = zt{xj)aAzti3 

i=l j^i 


(66) +26'/3^JJzt(xj) • [zl{xi) - zt{xi)] 

i=l j^i 
n—1 n 

+ 47 EE n Ztixk) ■ [ztixi)il - Ztixj))] Xi} 

i=l j=i+l kj^iJ 

in which we used the formal notation dL]’^ = 5{x^.(t)=xi(t)}The precise 
meaning of the last term involves integration w.r.t. local times and is ex¬ 
plained in (70). 

We now follow Proposition 1 in [1] to use the duality method encapsulated 
in Theorem 4.4.11 of Ethier and Kurtz [21]. In their notation a = /3 = 0. 

n 

F{z,x) = Y[z{xi) if X = (xi, • • • , x„). 

i=l 

They suppose F{zt, x) — /g G{zs,x) ds and F{z, x{t)) — H{z, x(s)) ds are 
martingales and conclude that for t > 0, 

(67) EF{zt,x{0))-EF{zo,x{t)) =E f G{zt-s,x{s)) - H{zt-s,x{s)) ds. 

Jo 

In our situation, (67) holds with G{z, x) = CzF{z, x) and H{z, x) = CxF{z, x). 

By the continuity of x i-A zt(x), we have F(zt,x(0)) F(zt,x(0)) and 

F(zo,x(t)) — F(zo,x(t)) a.s. as e —>■ 0, so using the bounded convergence 
theorem, 

EF(zt,x(0)) - EF(zo,x(t)) EF(zt,x(0)) - EF(zo, x(t)). 

To prove the desired duality formula 

n(0) n(t) 

( 68 ) E]^^;t(xi( 0 )) =Ej^zo(a:j(t)), t > 0 , 

i=l i=l 

it remains to argue that the RHS of (67) tends to zero as e ^ 0. 

The first term in (65) agrees with that of (66). For the second terms, 
omitting 20/3, the integrand of the difference is 

(69) 


e/ ^ Jj2;t_s(xj(s)) zf_^{y)p^{y - Xi{s))dy - z^_,{xi{s)) 




ds ^ 0 
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a.s. for s € (0, i), by continuity of y i—)• Zs(y) and dominated convergence. 
The contribution to (67) from the third term of (66) is (omitting dy) 

,n(s)-l n(s) 

(70) E / ^ ^ Yl Zt-sixkis)) ■[zt-sixiis)){l-zt-sixjis)))]dV^^ 

which converges, by dominated convergence to 

.n(s)-l n(s) 

(71) E / Y Yl zt-sixkis))-[zt-sixiis)){l-zt-sixjis)))]dLl'\ 

0 i=l j=i+l k^i,j 

Finally, we consider the contribution to (67) from the third term of (65). 
After the substitution y t-A y + Xi(s), we have (omitting dy) 

,n(s)-l n(s) 

(72) ^ yZ p^iy)p"iy + ^ii^)-^ji^))^stiy)dyds, 

do 

where for any i, j, 

Ylj{y)= zt-sixkis)) ■ [zt-s{y + Xi{s)){l - zt-siy + Xi{s)))]. 

At this point we would like to apply Lemma 2 of [1], to obtain 

(73) [ p^iy + Xi{s)-Xj{s))Y^’J{y)ds= I [ p^y + z)Y^’J(y) dz, 

Jo J Jo 

where denotes the local time of the process Xj — Xi at z. Their formula 
asssumes Y is predictable, so we substitute s— for s and note that this does 
not change the integral in (72). 

Putting (73) into (72), then using the continuity of the local time and 
that of T^*’/(y) (see details in pages 172d-1725 of [1]), the integrand of (72) 
converges a.s. to the integrand of (71). Convergence of expectations then 
follows from dominated convergence and the proof of (68) is complete. 

8.2. Duality for the coupled SPDE. To prove the duality formula for the 
coupled SPDE, we order the particles in Xj(t), i < n{t) in such a way that (i) 
when two particles coalesce we keep the smaller index and (ii) immediately 
after particle i gives birth, its offspring has index i and all particles with 
index > i (including the one that just gave birth) increase their index by 
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1. Adding the number of particles in the dual as another variable to help 
clarify things, for A; > 1 we let 


(74) Fk{{z,£),{x,n)) = ^ 2 :(xi). 


i<jl<j 2 <---<jk<n 


The following duality formula is motivated by (4): 


(75) EFk{{zt,it),{xo,no)) ='EFki{zo,io),{xt,nt)), k > 1. 


Once we have shown (75), the uniqueness claimed in Lemma 2 follows, 
since it allows us to conclude that the distribution at a fixed time is unique by 
identifying all cross moments of the bounded functions u and i. Uniqueness 
of the law of the process {u,i) then follows from Theorem 4.2 in Chapter 
4 of Ethier and Kurtz [21], The duality function (68) for the Wright-Fisher 
SPDE can be viewed as the case A: = 0. We shall provide the details of proof 
of (75) for the case A: = 1, the other cases are similar. 

We let £t{x) = / ^t{y)p^{y — x) dy and note 



Writing for the generator of {zt{xi),..., z{xk-i),i{xk)) with xi,...Xn 
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and n fixed, dT:\it[C^ it)■, 


n k—1 k—1 


(76) 


aAztixi) • itixk) + zt{xj)aAit{xk) 


k=2 i=l l<3<k 

n k—1 

j=i 

r 

(77) 

k=2 i=l j^i 

n k—1 

■ ^t{xk) / [zt{y) - Zt{y)]p\y - x*) dy 

(78) 

k=l j=l 

n k—lj—1 

ztiy)itiy)p^iy - Xk) dy 

(79) 

n 

ZtiXh) ■ it{Xk) 


k=3 j=2 i=l l<h<k- 

h^i,j 

1 


J 

[[zt{y)i'i- - Ztiy))]p‘'{y - Xi)p\y - Xj) dy 

(80) 

n k—1 « 

n Zt{xh) l[zt{y)it{y)]p''{y-Xi)p^{y-Xk)dy. 

1 _ o • _ 1 i<r'fc<r't_i J 


k=2 i=l i<h<k-i 
h^i 


Writing Cx^n for the generator of the dual ordered particle system, we 
want to compute 

(81) drift[/la;,„Fi((z,l), (x,n))]. 

and to show, as in (67), that 

(82) ^ 

E [ C^iFi{{zt-s,it-s), {x{s),n{s)))-Cx,nFi{{zt-s,it-s), {x{s),n{s)))ds 0. 

JQ 

The terms coming from particle Brownian motions are 


(83) 


n 


E 



n 


k-l \ 

zt{xj)aMt{xi) ■ it{xk) + zt{xi) ■ Mt{xk) 

) 


which agree with (76), and hence cancel in (82). 

Birth terms. Given a vector x = (xi,... x„), let x® = (xi,... x^, Xj,... Xn) 
be the vector of length n + 1 with the i coordinate duplicated. The total 
change in the drift (81) due to births is (omitting the 20/3) 


E 


n+1 fc—1 n k—1 

Wzt{x])it{xl) - ^ Wzt{Xj)it{Xk) 


k=i-\-l j=l 


k=i-\-l j=l 


(84) 
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Here i is the location of the duplication and there is no change in the drift 
for terms with k < i. The difference 

n+1 n n k—1 

(85) ~ n ^t{xj)it{xk) 

k=i-\-2 k=i-\-l k=i-\-l j=l 

since when fe > i + 2, we have x^^-^ = and zt(xi) appears twice in the 
product. After we interchange the order of the summation this agrees with 
(77) if we replace p^{y — Xi) in that formula by a pointmass at x*. The 
remaining term (A: = i + 1) in the first sum in (84) is 

n i 

( 86 ) 

i=l j=l 

since x] = xl_^_i = Xi. This agrees with (78) if we again replace p‘^{y — Xi) by 
a pointmass at Xj. As we argued in (69) it follows that 

E [ (77) - (85) + (78) - ( 86 ) ds ^ 0 
Jo 


Killing terms. Given a vector x = (xi,... Xn) let x^ = (xi,... Xj-i,Xj+i,... x„) 
be the vector of length n— 1 with the j coordinate removed. The total change 
in the drift (81) due to deaths is (omitting the dy) 


(87) 


n J-1 

EE 


j=2 i=l 


n—1k—1 n k—1 

n n Zt{Xh)it{Xk) 


k=j h=l 


k=j h=l 


^{xj=Xi}- 


Here j is the location of the deletion and there is no change in the drift for 
terms with k < j. When j < k, xj^ = Xk+i so we have 


n—1 n 


E- E 

k=j k=j+l 


n 

E n ZtiXh)[l - ZtiXj)]itiXk). 

k=j+l l<h<k-l 
h^j 


The remaining term coming from k = j in the second sum is 


- ztixh)itixj)- 

h=l 
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Using these results in (87) and noting that in the first case j = n is impos¬ 
sible, we have 


n—lj—1 n 

EE E n Zt(Xh} ■ ^t(Xk) ■ Zt{Xi)[l - Ztixj)] (5{xj=Xi} 

j=2 i=l k=j+l I<h<k-1 
n j—l 

j=2 i=l l<h<j—l^h^i 

where in the second sum we have changed j to k, and in both terms we 
have removed an additional term from the product over h. On other hand, 
interchanging the order of summation in (79) 

n k—1 j — 1 n—l n j—1 n—l j—1 n 

EEE = E E E = EE E 

k=3 j=2 1=1 j=2 k=j+l 1=1 j=2 1=1 k=j+l 

Hence formulas (79) and (80) become 

n—lj—1 n 

1212 n zt{xh) ■ it{xk) 

j=2 i=l k=j+l l<h<k-l 

(89) J [zt{y){l - zt{y))Y{y - Xi)p%y - Xj) dy 

n k—1 « 

-i 2 '^^i 2 n Zt{xh) / [zt{y)(-t{yW{y - Xi)p^{y - Xk) dy. 

k=2 i=l I<h<k-1 J 

Arguing as in (70)-(73) now completes the proof of (82). The proof of (75) 
is complete, and hence that of Lemma 2. 
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